译文|简明TensorFlow教程：所有的模型

作者：闹剧-豆腐渣_141 | 来源：互联网 | 2023-09-14 18:52

原文地址：TensorFlowinaNutshell — PartThree:AlltheModels原文作者：CamronGodbout译者：edvardhua校对者：marcm

原文地址：
TensorFlow in a Nutshell — Part Three: All the Models
原文作者：
Camron Godbout
译者：
edvardhua
校对者：
marcmoore,
cdpath

01概述

在本文中，我们将讨论 TensorFlow 中当前可用的所有抽象模型，并描述该特定模型的用例以及简单的示例代码。完整的工作示例源码（https://github.com/camrongodbout/TensorFlow-in-a-Nutshell）。

《译文 | 简明 TensorFlow 教程：所有的模型》
1.png

一个循环神经网络。

02递归神经网络 RNN

用例:语言建模，机器翻译，词嵌入，文本处理。

自从长短期记忆神经网络（LSTM）和门限循环单元（GRU）的出现，循环神经网络在自然语言处理中的发展迅速，远远超越了其他的模型。他们可以被用于传入向量以表示字符，依据训练集生成新的语句。这个模型的优点是它保持句子的上下文，并得出“猫坐在垫子上”的意思，意味着猫在垫子上。 TensorFlow 的出现让创建这些网络变得越来越简单。关于 TensorFlow 的更多隐藏特性可以从 Denny Britz 文章中找到。

import tensorflow as tf import numpy as np # Create input data X = np.random.randn(2, 10, 8) # The second example is of length 6 X[1,6,:] = 0 X_lengths = [10, 6] cell = tf.nn.rnn_cell.LSTMCell(num_units=64, state_is_tuple=True) cell = tf.nn.rnn_cell.DropoutWrapper(cell=cell, output_keep_prob=0.5) cell = tf.nn.rnn_cell.MultiRNNCell(cells=[cell] * 4, state_is_tuple=True) outputs, last_states = tf.nn.dynamic_rnn( cell=cell, dtype=tf.float64, sequence_length=X_lengths, inputs=X) result = tf.contrib.learn.run_n( {"outputs": outputs, "last_states": last_states}, n=1, feed_dict=None)

《译文 | 简明 TensorFlow 教程：所有的模型》
2.png

03卷积网络

用例:图像处理, 面部识别, 计算机视觉

卷积神经网络（Convolutional Neural Networks-简称 CNN ）是独一无二的，因为他可以直接输入原始图像，避免了对图像复杂前期预处理。 CNN 用固定的窗口（下图窗口为 3&＃215;3 ）从左至右从上往下遍历图像。其中我们称该窗口为卷积核，每次卷积（与前面遍历对应）都会计算其卷积特征。

《译文 | 简明 TensorFlow 教程：所有的模型》
3.gif

图片来源

我们可以使用卷积特征来做边缘检测，从而允许 CNN 描述图像中的物体。

《译文 | 简明 TensorFlow 教程：所有的模型》
4.jpg

GIMP 手册上边缘检测的例子

上图使用的卷积特征矩阵如下所示：

《译文 | 简明 TensorFlow 教程：所有的模型》
5.png

GIMP 手册中的卷积特征

下面是一个代码示例，用于从 MNIST 数据集中识别手写数字。

### Convolutional network def max_pool_2x2(tensor_in): return tf.nn.max_pool( tensor_in, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1], padding='SAME') def conv_model(X, y): # reshape X to 4d tensor with 2nd and 3rd dimensions being image width and # height final dimension being the number of color channels. X = tf.reshape(X, [-1, 28, 28, 1]) # first conv layer will compute 32 features for each 5x5 patch with tf.variable_scope('conv_layer1'): h_conv1 = learn.ops.conv2d(X, n_filters=32, filter_shape=[5, 5], bias=True, activation=tf.nn.relu) h_pool1 = max_pool_2x2(h_conv1) # second conv layer will compute 64 features for each 5x5 patch. with tf.variable_scope('conv_layer2'): h_conv2 = learn.ops.conv2d(h_pool1, n_filters=64, filter_shape=[5, 5], bias=True, activation=tf.nn.relu) h_pool2 = max_pool_2x2(h_conv2) # reshape tensor into a batch of vectors h_pool2_flat = tf.reshape(h_pool2, [-1, 7 * 7 * 64]) # densely connected layer with 1024 neurons. h_fc1 = learn.ops.dnn( h_pool2_flat, [1024], activation=tf.nn.relu, dropout=0.5) return learn.models.logistic_regression(h_fc1, y)

《译文 | 简明 TensorFlow 教程：所有的模型》
6.png

04前馈型神经网络

用例：分类和回归

这些网络由一层层的感知器组成，这些感知器接收将信息传递到下一层的输入，由网络中的最后一层输出结果。在给定层中的每个节点之间没有连接。没有原始输入和没有最终输出的图层称为隐藏图层。

这个网络的目标类似于使用反向传播的其他监督神经网络，使得输入后得到期望的受训输出。这些是用于分类和回归问题的一些最简单的有效神经网络。下面代码展示如何轻松地创建前馈型神经网络来分类手写数字：

def init_weights(shape): return tf.Variable(tf.random_normal(shape, stddev=0.01)) def model(X, w_h, w_o): h = tf.nn.sigmoid(tf.matmul(X, w_h)) # this is a basic mlp, think 2 stacked logistic regressions return tf.matmul(h, w_o) # note that we dont take the softmax at the end because our cost fn does that for us mnist = input_data.read_data_sets("MNIST_data/", one_hot=True) trX, trY, teX, teY = mnist.train.images, mnist.train.labels, mnist.test.images, mnist.test.labels X = tf.placeholder("float", [None, 784]) Y = tf.placeholder("float", [None, 10]) w_h = init_weights([784, 625]) # create symbolic variables w_o = init_weights([625, 10]) py_x = model(X, w_h, w_o) cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(py_x, Y)) # compute costs train_op = tf.train.GradientDescentOptimizer(0.05).minimize(cost) # construct an optimizer predict_op = tf.argmax(py_x, 1) # Launch the graph in a session with tf.Session() as sess: # you need to initialize all variables tf.initialize_all_variables().run() for i in range(100): for start, end in zip(range(0, len(trX), 128), range(128, len(trX)+1, 128)): sess.run(train_op, feed_dict={X: trX[start:end], Y: trY[start:end]}) print(i, np.mean(np.argmax(teY, axis=1) == sess.run(predict_op, feed_dict={X: teX, Y: teY})))

《译文 | 简明 TensorFlow 教程：所有的模型》
7.png

05线性模型

用例：分类和回归

线性模型根据 X 轴值的变化，并产生用于Y轴值的分类和回归的最佳拟合线。例如，如果你有一片区域房子的大小和价钱，那么我们就可以利用线性模型来根据房子的大小来预测价钱。

需要注意的一点是，线性模型可以用于多个特征。例如在住房示例中，我们可以根据房子大小，房间数量和浴室数量以及价钱来构建一个线性模型，然后利用这个线性模型来根据房子的大小，房间以及浴室个数来预测价钱。

import numpy as np import tensorflow as tf import numpy as np import tensorflow as tf def weight_variable(shape): initial = tf.truncated_normal(shape, stddev=1) return tf.Variable(initial) # dataset xx = np.random.randint(0,1000,[1000,3])/1000. yy = xx[:,0] * 2 + xx[:,1] * 1.4 + xx[:,2] * 3 # model x = tf.placeholder(tf.float32, shape=[None, 3]) y_ = tf.placeholder(tf.float32, shape=[None]) W1 = weight_variable([3, 1]) y = tf.matmul(x, W1) # training and cost function cost_function = tf.reduce_mean(tf.square(tf.squeeze(y) - y_)) train_function = tf.train.AdamOptimizer(1e-2).minimize(cost_function) # create a session sess = tf.Session() # train sess.run(tf.initialize_all_variables()) for i in range(10000): sess.run(train_function, feed_dict={x:xx, y_:yy}) if i % 1000 == 0: print(sess.run(cost_function, feed_dict={x:xx, y_:yy}))

《译文 | 简明 TensorFlow 教程：所有的模型》
8.png

06支持向量机

用例：目前只能用来做二进制分类

SVM 背后的一般思想是存在线性可分离模式的最佳超平面。对于不可线性分离的数据，我们可以使用内核函数将原始数据转换为新空间。 SVM 使分离超平面的边界最大化。它们在高维空间中非常好地工作，并且如果维度大于取样的数量，SVM 仍然有效。

def input_fn(): return { 'example_id': tf.constant(['1', '2', '3']), 'price': tf.constant([[0.6], [0.8], [0.3]]), 'sq_footage': tf.constant([[900.0], [700.0], [600.0]]), 'country': tf.SparseTensor( values=['IT', 'US', 'GB'], indices=[[0, 0], [1, 3], [2, 1]], shape=[3, 5]), 'weights': tf.constant([[3.0], [1.0], [1.0]]) }, tf.constant([[1], [0], [1]]) price = tf.contrib.layers.real_valued_column('price') sq_footage_bucket = tf.contrib.layers.bucketized_column( tf.contrib.layers.real_valued_column('sq_footage'), boundaries=[650.0, 800.0]) country = tf.contrib.layers.sparse_column_with_hash_bucket( 'country', hash_bucket_size=5) sq_footage_country = tf.contrib.layers.crossed_column( [sq_footage_bucket, country], hash_bucket_size=10) svm_classifier = tf.contrib.learn.SVM( feature_columns=[price, sq_footage_bucket, country, sq_footage_country], example_id_column='example_id', weight_column_name='weights', l1_regularization=0.1, l2_regularization=1.0) svm_classifier.fit(input_fn=input_fn, steps=30) accuracy = svm_classifier.evaluate(input_fn=input_fn, steps=1)['accuracy']

《译文 | 简明 TensorFlow 教程：所有的模型》
9.png

07深和宽的模型

用例：推荐系统，分类和回归

深和宽模型在第二部分中有更详细的描述，所以我们在这里不会讲解太多。宽和深的网络将线性模型与前馈神经网络结合，使得我们的预测将具有记忆和泛化。这种类型的模型可以用于分类和回归问题。这允许利用相对准确的预测来减少特征工程。因此，能够结合两个模型得出最好的结果。下面的代码片段摘自第二部分。

def input_fn(df, train=False): """Input builder function.""" # Creates a dictionary mapping from each continuous feature column name (k) to # the values of that column stored in a constant Tensor. continuous_cols = {k: tf.constant(df[k].values) for k in CONTINUOUS_COLUMNS} # Creates a dictionary mapping from each categorical feature column name (k) # to the values of that column stored in a tf.SparseTensor. categorical_cols = {k: tf.SparseTensor( indices=[[i, 0] for i in range(df[k].size)], values=df[k].values, shape=[df[k].size, 1]) for k in CATEGORICAL_COLUMNS} # Merges the two dictionaries into one. feature_cols = dict(continuous_cols) feature_cols.update(categorical_cols) # Converts the label column into a constant Tensor. if train: label = tf.constant(df[SURVIVED_COLUMN].values) # Returns the feature columns and the label. return feature_cols, label else: return feature_cols m = build_estimator(model_dir) m.fit(input_fn=lambda: input_fn(df_train, True), steps=200) print m.predict(input_fn=lambda: input_fn(df_test)) results = m.evaluate(input_fn=lambda: input_fn(df_train, True), steps=1) for key in sorted(results): print("%s: %s" % (key, results[key]))

《译文 | 简明 TensorFlow 教程：所有的模型》
10.png

08随机森林

用例：分类和回归

随机森林模型中有很多不同分类树，每个分类树都可以投票来对物体进行分类，从而选出票数最多的类别。

随机森林不会过拟合，所以你可以使用尽可能多的树，而且执行的速度也是相对较快的。下面的代码片段是对鸢尾花数据集（Iris flower data set）使用随机森林：

hparams = tf.contrib.tensor_forest.python.tensor_forest.ForestHParams( num_trees=3, max_nodes=1000, num_classes=3, num_features=4) classifier = tf.contrib.learn.TensorForestEstimator(hparams) iris = tf.contrib.learn.datasets.load_iris() data = iris.data.astype(np.float32) target = iris.target.astype(np.float32) mOnitors= [tf.contrib.learn.TensorForestLossMonitor(10, 10)] classifier.fit(x=data, y=target, steps=100, mOnitors=monitors) classifier.evaluate(x=data, y=target, steps=10)

《译文 | 简明 TensorFlow 教程：所有的模型》
11.png

09贝叶斯强化学习

用例：分类和回归

在 TensorFlow 的 contrib 文件夹中有一个名为 BayesFlow 的库。除了一个 REINFORCE 算法的例子就没有其他文档了。该算法在 Ronald Williams 的论文中提出。

获得的递增 = 非负因子强化偏移合格的特征

这个网络试图解决立即强化学习任务，在每次试验获得强化值后调整权重。在每次试验结束时，每个权重通过学习率因子乘以增强值减去基线乘以合格的特征而增加。 Williams 的论文还讨论了使用反向传播来训练强化网络。

"""Build the Split-Apply-Merge Model. Route each value of input [-1, -1, 1, 1] through one of the functions, plus_1, minus_1\. The decision for routing is made by 4 Bernoulli R.V.s whose parameters are determined by a neural network applied to the input. REINFORCE is used to update the NN parameters. Returns: The 3-tuple (route_selection, routing_loss, final_loss), where: - route_selection is an int 4-vector - routing_loss is a float 4-vector - final_loss is a float scalar. """ inputs = tf.constant([[-1.0], [-1.0], [1.0], [1.0]]) targets = tf.constant([[0.0], [0.0], [0.0], [0.0]]) paths = [plus_1, minus_1] weights = tf.get_variable("w", [1, 2]) bias = tf.get_variable("b", [1, 1]) logits = tf.matmul(inputs, weights) + bias # REINFORCE forward step route_selection = st.StochasticTensor( distributions.Categorical, logits=logits)

《译文 | 简明 TensorFlow 教程：所有的模型》
12.png

10线性链条件随机域 CRF

用例：序列数据

CRF 是根据无向模型分解的条件概率分布。他们预测单个样本的标签，保留来自相邻样本的上下文。 CRF 类似于隐马尔可夫模型。 CRF 通常用于图像分割和对象识别，以及浅分析，命名实体识别和基因发现。

# Train for a fixed number of iterations. session.run(tf.initialize_all_variables()) for i in range(1000): tf_unary_scores, tf_transition_params, _ = session.run( [unary_scores, transition_params, train_op]) if i % 100 == 0: correct_labels = 0 total_labels = 0 for tf_unary_scores_, y_, sequence_length_ in zip(tf_unary_scores, y, sequence_lengths): # Remove padding from the scores and tag sequence. tf_unary_scores_ = tf_unary_scores_[:sequence_length_] y_ = y_[:sequence_length_] # Compute the highest scoring sequence. viterbi_sequence, _ = tf.contrib.crf.viterbi_decode( tf_unary_scores_, tf_transition_params) # Evaluate word-level accuracy. correct_labels += np.sum(np.equal(viterbi_sequence, y_)) total_labels += sequence_length_ accuracy = 100.0 * correct_labels / float(total_labels) print("Accuracy: %.2f%%" % accuracy)

11总结

自从 TensorFlow 发布以来，围绕该项目的社区一直在添加更多的组件，示例和案例来使用这个库。即使在撰写本文时，还有更多的模型和示例代码正在编写。很高兴看到 TensorFlow 在过去几个月中的成长。组件的易用性和多样性正在增加，在未来也会平稳的增加。

12我的参考文献

1、词嵌入

2、长短记忆网络

3、卷积神经网络

4、前馈神经网络

欢迎关注我们的微信公众号：人工智能LeadAI，ID：atleadai

推荐阅读

import
Java 并发编程：深入解析 AtomicInteger 和 CAS 无锁算法

在多线程并发环境中，普通变量的操作往往是线程不安全的。本文通过一个简单的例子，展示了如何使用 AtomicInteger 类及其核心的 CAS 无锁算法来保证线程安全。 ... [详细]

蜡笔小新 2024-11-12 16:40:04
import
使用Tkinter构建51Ape无损音乐爬虫UI

本文介绍了如何使用Python的内置模块Tkinter来构建一个简单的用户界面，用于爬取51Ape网站上的无损音乐百度云链接。虽然Tkinter入门相对简单，但在实际开发过程中由于文档不足可能会带来一些不便。 ... [详细]

蜡笔小新 2024-11-15 10:31:11
import
Leetcode学习成长记：天池leetcode基础训练营Task01数组

前言这是本人第一次参加由Datawhale举办的组队学习活动，这个活动每月一次，之前也一直关注，但未亲身参与过，这次看到活动 ... [详细]

蜡笔小新 2024-11-14 18:01:31
import
pytorch(一)：torch构建数据集并训练一个神经网络

目录预备知识导包构建数据集神经网络结构训练测试精度可视化计算模型精度损失可视化输出网络结构信息训练神经网络定义参数载入数据载入神经网络结构、损失及优化训练及测试损失、精度可视化qu ... [详细]

蜡笔小新 2024-11-14 13:06:38
default
解决Only fullscreen opaque activities can request orientation错误的方法

本文介绍了在使用PictureSelectorLight第三方框架时遇到的Only fullscreen opaque activities can request orientation错误，并提供了一种有效的解决方案。 ... [详细]

蜡笔小新 2024-11-13 09:46:25
import
大类|电阻器_使用Requests、Etree、BeautifulSoup、Pandas和Path库进行数据抓取与处理 | 将指定区域内容保存为HTML和Excel格式

大类|电阻器_使用Requests、Etree、BeautifulSoup、Pandas和Path库进行数据抓取与处理 | 将指定区域内容保存为HTML和Excel格式 ... [详细]

蜡笔小新 2024-11-11 19:05:59
int
PHP-Casbin v3.20.0 发布，性能显著提升

PHP-Casbin v3.20.0 已经发布，这是一个使用 PHP 语言开发的轻量级开源访问控制框架，支持多种访问控制模型，包括 ACL、RBAC 和 ABAC。新版本在性能上有了显著的提升。 ... [详细]

蜡笔小新 2024-11-15 10:54:38
import
如何在R中得到矩阵的右特征向量? - How to obtain right eigenvectors of matrix in R?

Edition:theprobleminmyquestionwasIvetriedtofindmatrixSfromequation8butthisequati ... [详细]

蜡笔小新 2024-11-13 17:16:49
import
Java 编程错误：对象无法转换为 long 类型

本文介绍了在 Java 编程中遇到的一个常见错误：对象无法转换为 long 类型，并提供了详细的解决方案。 ... [详细]

蜡笔小新 2024-11-13 10:57:24
post
微信公众号推送模板40036问题

返回码错误码描述说明40001invalidcredential不合法的调用凭证40002invalidgrant_type不合法的grant_type40003invalidop ... [详细]

蜡笔小新 2024-11-12 16:31:32
import
实验九：使用SharedPreferences存储简单数据

本实验旨在帮助学生理解和掌握使用SharedPreferences存储和读取简单数据的方法，包括程序参数和用户选项。 ... [详细]

蜡笔小新 2024-11-12 14:21:47
foreach
深入解析 Lifecycle 的实现原理

本文将详细介绍 Android Jetpack 中 Lifecycle 组件的实现原理，帮助开发者更好地理解和使用 Lifecycle，避免常见的内存泄漏问题。 ... [详细]

蜡笔小新 2024-11-12 14:05:19
search
解决Bootstrap DataTable Ajax请求重复问题

在最近的一个项目中，我们使用了JQuery DataTable进行数据展示，虽然使用起来非常方便，但在测试过程中发现了一个问题：当查询条件改变时，有时查询结果的数据不正确。通过FireBug调试发现，点击搜索按钮时，会发送两次Ajax请求，一次是原条件的请求，一次是新条件的请求。 ... [详细]

蜡笔小新 2024-11-12 13:59:27
future
Java并发编程指南：深入理解信号量机制

本文是Java并发编程系列的开篇之作，将详细解析Java 1.5及以上版本中提供的并发工具。文章假设读者已经具备同步和易失性关键字的基本知识，重点介绍信号量机制的内部工作原理及其在实际开发中的应用。 ... [详细]

蜡笔小新 2024-11-11 15:49:02
search
如何使用 `org.apache.tomcat.websocket.server.WsServerContainer.findMapping()` 方法及其代码示例解析

如何使用 `org.apache.tomcat.websocket.server.WsServerContainer.findMapping()` 方法及其代码示例解析 ... [详细]

蜡笔小新 2024-11-11 10:08:55

闹剧-豆腐渣_141

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章